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Abstract 

■ We study the the following question in Random Graphs. We are given two disjoint sets L, R 

with \L\ = n = am and \R\ = m. We construct a random graph G by allowing each x £ L to 
choose d random neighbours in R. The question discussed is as to the size //(G) of the largest 
£j \ matching in G. When considered in the context of Cuckoo Hashing, one key question is as to 

when is //(G) = n whp? We answer this question exactly when d is at least three. We also 
establish a precise threshold for when Phase 1 of the Karp-Sipser Greedy matching algorithm 
suffices to compute a maximum matching whp. 

in 

CO 

i/-) ■ 1 Introduction 

d , 

For a graph G we let //(G) denote the size of the maximum matching in G. In essence this paper 
provides an analysis of //(G) in the following model of a random bipartite graph. We have two 
disjoint sets L,R where L = [n},R = [m] where n = am. Each v G L independently chooses d 
random vertices of R as neighbours. Our assumptions are that a > 0, d > 3 are fixed and n — > oo. 
One motivation for this study comes from Cuckoo Hashing. 



CO 



X 



Briefly each one of n items x £ L has d possible locations h\{x), h,2(x), . . . , hd(x) € R, where 
d is typically a small constant and the hi are hash functions, typically assumed to behave as 
independent fully random hash functions. (See [21] for some justification of this assumption.) 
We are thus led to consider the bipartite graph G which has vertex set L U R and edge set 
{(x, hj(x)) : x £ L, j = 1, , 2, . . . , d}. Under the assumption that the hash functions are completely 
random we see that G has the same distribution as the random graph defined in the previous 
paragraph. 

We assume each location can hold only one item. When an item x is inserted into the table, it 
can be placed immediately if one of its d locations is currently empty. If not, one of the items in 
its d locations must be displaced and moved to another of its d choices to make room for x. This 
item in turn may need to displace another item out of one its d locations. Inserting an item may 
require a sequence of moves, each maintaining the invariant that each item remains in one of its d 
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potential locations, until no further evictions are needed. Thus having inserted k items, we have 
constructed a matching M of size k in G. Adding a (k + l)'th item is tantamount to constructing 
an augmenting path with repsect to M. All n items will be insertable in this way iff G contains a 
matching of size n. 

The case of d = 2 choices is notably different from that for other values of d and the theory for the 
case where there are d = 2 bucket choices for each item is well understood at this point [9j EOJ [22] . 
We will therefore assume that d > 3. 

We will now revert to the abstract question posed in first paragraph of the paper. 



2 Definitions and Results 



This question was studied to some extent by Fotakis, Pagh, Sanders and Spirakis [15J. They show 
in the course of their analysis of Cuckoo hashing that the following holds: 

Lemma 1 Suppose that < e < 1 and d > 2(1 + e)log(e/e). Suppose also that m = (1 + e)n. 
Then whp G contains a matching of size n i.e. a matching of L into R. 



□ 

In particular, if d = 3 and m ~ 1.57n then Lemma Q] shows that there is a matching of L into R 
whp. 

This lemma is not tight and recently Mitzenmacher et al [12] observed a connection with a result 
of Dubois and Mandler on Random 3-XORSAT [10] that enables one to essentially answer the 
question as to when fJ,(G) > n for the case d = 3. More recently, Fountoulakis and Panagiotou [11] 
have established thresholds for when there is a matching of L into R whp, for all d > 3. 

We begin with a simple observation that is the basis of the Karp-Sipser Algorithm [16\ [2]. If v is a 
vertex of degree one in G and e is its unique incident edge, then there exists a maximum matching 
of G that includes e. Karp and Sipser exploited this via a simple greedy algorithm: 



Algorithm 1 Karp-Sipser Algorithm 



procedure KSGreedy(G) 
M <- 

while T / do 

if r has vertices of degree one then 

Select a vertex £ uniformly at random from the set of vertices of degree one 

Let e = (£, r\) be the edge incident to £ 
else 

Select an edge e = (v, u) uniformly at random 
end if 

M^MU {e} 

r-r\{£,ry} 

end while 
return M 
end procedure 



2 



Phase 1 of the Karp-Sipser Algorithm ends and Phase 2 begins when the graph remaining has 
minimum degree at least two. So if T± denotes the graph T remaining at the end of Phase 1 and 
r\ is the number of iterations involved in Phase 1 then 

M(G)=TL +MTi). (1) 

Our approach to estimating (J,(G) is to (i) obtain an asymptotic expression for t\ that holds whp 
and then (ii) show that whp T\ has a (near) perfect matching and then apply ([T|). 

We summarise our results as follows: Let z\ satisfy 

e Zl - 1 

* = — < 2 > 

and let 

01 = d(l-e-^ )«*-!• (3) 



Theorem 2 If a < ct\ then whp fj,(G) =T\ = n. 

Thus whp Phase 1 of the Karp-Sipser Algorithm finds a (near) maximum matching if a < a\. In 
particular, if d = 3 then z\ ~ 1.251 and ct\ ~ .818 and thus m ~ 1.222n is enough for a matching 
of L into ii!. 

Andrea Montanari has pointed out that our proof of Theorem [2] via the differential equations 
method is not new and already appears in Luby, Mitzenmacher, Shokrollahi and Spielman [13] and 
also in Dembo and Montanari [8]. We will prune this from the final version of the paper, but leave 
it in here for now. 

Now consider larger a. Let z* be the largest non-negative solution to 

i 

Z \ d-l 



— ) +e~ z -l = 0. 
ad/ 



Theorem 3 If a > a\ then whp 
(a) z* > 0. 



(b) n ~ rt 1 • 1 



_d 
I d 

ad j 



(c) Ifd>3 then 

/i(ri) = min{|Li[, \Ri\} = minjn - t%, (1 - (1 + z*)e~ z *)m + o(m)| . (4) 

Here L\ C L,i?i C are t/ie too sides o/ the bipartition ofT\, after deleting any isolated 
vertices from the R-side. 
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3 Structure of the paper 



We first prove Theorem [2j This involves studying Phase 1 of the Karp-Sipser Algorithm. For this 
we first describe the distribution of the graph G. This is done in Section SJ The distribution of T 
is determined by a few parameters and these evolve as a Markov chain. To study this chain, we 
introduce and solve a set of differential equations. This is done in Section [5j We show that the 
chains trajectory and the solution to the equations are close. By analysing the equations we can 
tell when Phase 1 is sufficient to solve the problem. This is done in Section [6J If Phase 1 is not 
sufficient then the graph T\ that remains has degree d on the L-side and minmum degree at least 
two on the .R-side. We show that whp T± has a matching of size equal to the minimum set size of 
the partition. [12] and [9] and [TTj . 

4 Probability Model for Phase 1 

We will represent G and more generally T by a random sequence x G Ql,r = {R d U {*} d ) L ■ A 
sequence x G £Il,R is to be viewed as n subsequences xi, X2, . . . , x n where Xj = (xj t %, Xj^, ■ ■ ■ , Xj t d) G 
R d or Xj = a = (*,*,...,*). The *'s represent edges that have been deleted by the Karp-Sipser 
algorithm. For x G £Il,r we define the bipartite (multi-)graph T x as follows: Its vertex set consists 
of a bipartition L x = {j G L : Xj ^ a} (the left side) and R (the right side). The edges incident 
with j G L x are (j,Xj^), i G [d] i.e. we read the sequence x from left to write and add edges to T x 
in blocks of size d. Each block being assigned to a unique vertex of L x . 

We should be clear now that our probability space is £Il,r with uniform measure and not Ql,r = 
{r x : x G Ql,r}- 

Given a graph r x we let Vj = Vj(x) = |J2j-(x)| where Rj(x) is the set of vertices in R that have 
degree j > 0. We let v = v(x) = \R X \ where R x = Uj>2 ^'( x )- 

For the graph G we choose x(0) uniformly at random from {R d ) L and put G = r x ( v Next let 
r(0) = G and let T(t) = T x m be the graph T that we have after t steps of Phase 1 of the Karp- 
Sipser algorithm. The sequence x(t) is defined as follows: Observe first that the vertex £ of degree 
one is always in R. Suppose that it is incident to the unique edge (?],£), rj G L. Then we simply 
replace x^ in x(t — 1) by a to obtain x(t). We should thus think of the Karp-Sipser Algorithm as 
acting on sequences x and not on graphs. We write x — » y to mean that y can be obtained from 
x by a single Phase 1 step of the Karp-Sipser Algorithm. 

Let v(t) = (w(t),vi(t),v(t)) where w(t) = \L x a\\ is the number of vertices on the left side of the 
bipartition of T(t). Assuming that we have only run the Karp-Sipser algorithm up to the end of 
Phase 1, we have w(t) = n — t. Also 

v{0) ~ (n, ade- ad m, (1 - e~ ad - ade- ad )m) whp. (5) 

We will omit the parameter t from v(t) when it is clear from the context. Let X$ be the set of all 
x 6 ^l,r with parameters v. 

Lemma 4 Suppose x(0) is a random member of X$(n\. Then given v(0), . . . ,v{t), x(t) is a random 
member of X^m f or all t >0. 
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Proof: We prove this by induction on t. It is true for t = by assumption and so assume it is true 
for some t > 0. Let v(t) = (w, Vi,v) and now fix a triple if = (w' = w — 1, v[,v') as a possible value 
for v(t + 1). Fix y € ^"(w'y ,«') • We first compute the number of x S Xjf(t) sucn that x — > y. Let 
b = v — v' be the number of vertices in i? x \ R y . Some of these will be in Ro(y) and some will be 
in .Ri(y). So we choose non-negative integers bo, b\ such that bo + b\ = b. Next let a = v% — v[ + &i 
be the number of vertices in i?i(x) n -Ro(y)- We can choose a vertex r] so that L x \ L y = {n} in t 
ways and now let us enumerate the ways of choosing x^ = (Cl> C2> • • • ; Cd)- Om choices for Q are 
(i) distinctly from Ro(y) (i-e. Ci is distinct from rest of the Q), (ii) non-distinctly from i?o(y) (i.e. 
£ = is chosen more than once in the construction), (iii) from i?i(y) and (iv) from i? y . We must 
exercise choice (i) exactly a times, choice (ii) at least twice for each of bo distinct values, choice (iii) 
at least once for each of b\ distinct values and choice (iv) the remaining times. 

The number of choices for x^ depends only on v and if, i.e. for each y E X$i we have that 
D(v, if) = [{x £^j:x-> y}| is independent of y, given v and if. 

Similarly given x there is a unique i £ R\(x), which when removed determines y. Thus N(v) = 
|{y : x — > y}| is fixed given v. Thus if x(i) is a random member of X$ then 

P(x(t + l)=y|tf(0),...,tf(t)) 

= Yl P(x(t)=x|t?(0),... ) ^t))-P(x(t + l) = y|ir(0) J ...,^t-l),x(i)=x) 

x G"*V(t) 

= Yl P(x(t + l)=y|x(t)=x)-|A' t;(t) |~ 1 

x G"*V(t) 

^ D(v(t),v(t + 1)) , ._! 

which is independent of y given v(t) and so y is a random member of Xfia+i) ■ D 
Lemma 5 The random sequence v(t),t = 0, 1, 2, . . . is a Markov chain. 

Proof: As in [2], 

P(v(t + l)\v(0),v(l),...,v(t)) = P(x'K(0)^(l),...,^)) 

= E E p(x',x|^(o),^(i),...,^)) 
= E E p^i^^),-,^-!)^) 

xP(x|t;(0),i;(l),..., <;(*)) 

= E E p(x'\x)\xv(t)\-\ 

which depends only on v(t),v(t + 1). □ 

Lemma 6 Conditional on v if x zs selected uniformly at random from X$ then each vertex i 6 i? x 
/tas degree Yi where Y{ = Poi(z; > 2), a Poisson random variable conditioned to take a value at least 
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two, and z satisifes 



z(e z — 1) dw — v\ 



(6) 



/(*) 



where f(z) = e z — z — 1. 

The Yi are also conditioned to satisfy Xa=i ^ = wc ^ ~ u i • 

Proof: Suppose we first fix the edges incident with vertices of degree one in x. Then we randomly 
fill in the remaining dw — v\ non-* positions in x with values from some fixed v-subset i? x of R, 
subject to each of these v vertices having degree at least two. The degrees Yi of these vertices will 
have the description described in the lemma (for a proof see Lemma 4 of [2]). □ 

From [2] we can use the following lemma 
Lemma 7 [2] 

(a) Assume that logn = 0{{vz)z). For every j G and 2 < k < logn, 



5 Differential Equations 

Let v be the current parameter tuple and v* be the tuple after one step of the Karp-Sipser algorithm. 
The following lemma gives E[v' — v\v] for each step of Phase 1 of the Karp-Sipser algorithm. 

Lemma 8 Assuming logv = 0({vz)z) and v\ > we have 



Proof: First note that v\ > 0, one vertex £ € R with deg(£) = 1 will be picked and £ and its 
neighbor r] G L will be removed from Gt- This implies that w decreases by 1 and the number of 
edges removed is d, i.e. all edges incident to n. Let 5 be the number of multiple edges incident to 
rj. Then we have 




(7) 



(b) For all k > 2,j G R x 




□ 




E[5\v\ < d 



P(rj is incident to 
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where Y has distribution (|7|). 

Explanation: The dw choices of neighbours for the remaining vertices in L form a list with v\ 
unique names and wd — v\ non-unique names and where the number of times a vertex appears 
among the wd — v\ has distribution (|7|). Also, if we construct this list vertex by vertex, it will 
appear in a random order. So the probability that j appears in two of the choices for n is bounded 
by E[ gg 2] and this justifies ©. 

The change in v% comes from £ being removed, minus the number of other degree one vertices 
adjacent to r/ and plus the number of vertices adjacent to rj of degree exactly two. Any change from 
vertices of degree three or more is absorbed by the O (—) term for multiple edges. 

The expected change is then 

d-1 , \ 2(d-l)_. . . „ f 1\ 

E[v[ - Vl v] = -1 - -(«! - 1 + Mv 2 v] + O - 

dw — 1 dw — 1 VW 

i d-1 d - 1 z 2 ^ / 1 \ . . 

= -\--r— vi + -r- -jV + O — (9) 

Similarly for v, the change is only due to vertices adjacent to y of degree exactly two, modulo 
multiple edges. Thus 

i-r / n d - 1 Z 2 „ / 1 

E «' - v\v] = -v + O — 



□ 



Lemma [8] suggests that we consider the following pair of differential equations 



dyi d-1 d-1 y( 2 , . 

^ = - 1 -^T yi + ^T7(C) (10) 



dy d-1 yC 



2 



dt dw /(C) 

where w = n — t and £ satisfies 

C(e^ - 1) du> - yi 



(11) 
(12) 



/(C) y 

The boundary conditions are (see (|5])) 

C(0) = ad, yi(0) = made~ ad , y(0) = m(l - (1 + ad)e~ ad ). (13) 
The yi, y, C are of course the deterministic counterparts of v±,v,z respectively. 
Lemma 9 The solution to (fTOj) . (fTTj) and (fl3j) is 



to = ( -i- ) n. (14) 




* = "{'-{td) ) (15) 
y = e- ? /(C)m (16) 

2/1 = mC ( f^V" 1 +e" C - 1 J (17) 
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Proof: We take the derivative of ([6]) with respect to t. The RHS becomes, using (jlOp and (QT 
d (dw-y{\ If d _ /_ 1 _ d-l yx + d - I y ( 2 \ dw - y x 1 ( d - 1 y ( 2 



dt \ y ) y\ \ d w d wf(()J y y\ d w /(C) 

d—lf dw — yi ( 2 dw — y\ C 2 



dw V y /(C) y /(C) 
d-i /C(e c -i)/(C) + C 2 /(C)-CV-i) 



dw V /(C) 5 

d-l /C(e C -l) 2 -C 3 e C 



dw V /(C) 2 

On the other hand, on differentiating the LHS of ([6]) (with z replaced by C) we get 

d fC(e<-l)\ _ (Ce^ + e^-l)/(C)-C(e C -l) 2 dC 



dt V /(C) 7 /(C) 2 dt 

(e< - l) 2 - CV dC 



Comparing ([18]) and (fT9l) we see that 



Integrating yields 



Plugging in C(0) = ad we see that 



/(C) 2 dt 

1 dC __ d-l 
C dt dw 

constant 



w d-i 



( d (ad) c 



w d 1 n d 1 



This verifies JUj) and (jT5j) . 

Going back to (fTTj) and ([20]) we have 

dy dC (d - l)y C 2 1 dC yC 2 



d(dt dt dw /(C) (dtf(() 

So 

Idy C 



ydC /(C) 

and integrating yields 

lny = -C + ln/(C) + C 
Taking y(0) = m(l — (1 + ad)e~ ad ) gives C = mm and we see that (I16p holds. 



(18) 



(19) 



(20) 



We now solve for y\ in terms of C as a function of d. It follows from (j 12 j) and (|16p that 

yi = dw + Ce~^m — mC 



) nd + (e — mC 



C N '' 
ad 
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□ 

At this point we wish to show that whp the sequence v(t),t > closely follows the trajectory 
y(t) = (w,y±,y),t > described in Lemma [H One possibility is to use Theorem 5.1 of Wormald 
[24] . but there is a problem with an "unbounded" Lipschitz coefficient. One can allow for this in 
[23], but it is unsatisfactory to ask the reader to check this. We have decided to use an approach 
suggested in Bohman [4]. 

Next let K be a large positive constant and let 7 <C 1/K. Then let 

g(x) = (1 - X y K + K(l - x)' 1 

and 

Err(t) = n 2/3 g(t/n). 

Then define the event 

S(t) = |f (r)z(r) > n 1/2 and ((t) > ((h) + n" 7 and \v(t) - j^r)^ < 2Err(r) for r < t} 
where 

h = min {t > : y x (t) = 0} . (21) 
Now define four sequences of random variables: 



Xf{t) : 
X±(t) : 

Because (1 — x)~ L is convex we have 



\ vi(t) - yi (t) ± Err(t) S(t-1) holds 
1 X 1 (t — 1) otherwise 

( v(t) - y(t) ± Err(t) S(t - 1) holds 
I X^ft — 1) otherwise 



1 1 hL 

> TZ + 



(I - (x + h)) L ~ (1 - x) L (l-x) L+1 
for L > and 0<x<x + h<l. So, 



+ l)/n) - g{t/n) > — g(t/n). (22) 
n — t 



Suppose that £(t) holds. We write 

dw — y\ dw — v\ 



y 



(dw - yi)(v - y) vi - y x 



yv 

(Err(t)\ 
V v 



O [ —^j . (23) 



(For this we need (dw — y\)/y = O(l). But this follows from (|12|) and the fact that ( is decreasing 
- see (pOjl ). 

Putting F(x) = X ^j(~^ we have (see (fT9l) ) F'(x) = ^ ~jhc^ & an< ^ srnce 

00 k 
(e- _ 1)2 _ x 2 e , = ^ (2fc _ 2 _ k(k _ (24) 
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we see that F'(x) = 0(1) in any bounded interval [0, L]. 
Hence from (I23p we have 

( Err{t)\ 



O 



V v 



■) 



F(()-F(z)=fl(\(-z\) 



or 



| C - z | = o(^). 



Using (|25|) we obtain 



m /(o 



\v-y\z 2 

~ /(*) ^ 
< K x Err(t) 



m /(o 



for some = i^i(a, d) > 0. 
For the second term we use 

- 2 V 2x/(x) -x 2 (e x - 1) 



This implies that ^ jj- 
Now with / = /(C) 



/o*o; /(x) 2 

0(1) for x > 0. 



j CO 

?V\2 XT 



fc-3 



/(x) 2 ^ (fc _ 1}! 



du; \ it) to / cfat; 
0(C- Wn- x ). 
d-1 fyC 2 d-1 (yC 4 



y( 2 y( 4 



f P * s V/ P 



+ 



dw \wf ' dw \ P V / 



2/1 - — ^ - yC ( t 

2C C(e C - 1) 



2 - 1 



P 



= 0{C° W n- 1 ). 
If 5(f) holds then, where p t - d ~ l - 



dw d(n—t) ' 

E(X+(t + l)-X+(t)\v(t)) = 

E( Vl (t + 1) - Vl (t) | v(t)) - ( yi (t + 1) - yi (t)) + n 2 / 3 (<?((i + l)/n) - g(t/n)) > 



y"(t + 0) + 



ifn 2 / 3 
n — t 



(25) 



(26) 



5 (t/n) 



>^f., 1 _ 2 + if) + 



n — t 



1 



u(t)z(t) 



> 0. 



This shows that X^(t),i > is a sub-martingale. Also, 

|X+(i + l)-X+(i)| < 

|«i(t + l)-t;i(t)|+ sup \y' 1 {t + e)\+n 2 l\g{{t + l)/n)-g{t/n))=0{l). 
Q<e<\ 
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It follows from the Azuma-Hoeffding inequality that we can write 

P(31 <t<h: X+(t) < X+(0) - n 3/5 ) < e - fi (™ 1/5 ). 

By almost identical arguments we have 

P(31 <t<h: Xf (t) > Xf(0) + n 3/5 ) < e" Q(nl/5) . 
P(31 < t < h : X+{t) < X+(0) - n 3 / 5 ) < e " fi (" 1/5 ). 
P(31 <t<h:X-(t)> X-(0) + n 3/5 ) < e " n (" 1/5 ). 



It follows that whp, when £{t) holds, we have 

\vi(t) - Vl (t)\ < Err{t) + n 3/5 + |t>i(0) - yi(0)| < 2Err(t). (27) 
\v(t) - y(t)\ < Err(t) + n 3/5 + |«(0) - y(0)| < 2Err(t). (28) 

Now by construction, £(t) will fail at some time t 2 <t\. It follows from (|27p . (|28p that whp it will 
fail either because (i) v(t 2 )z(t 2 ) < re 1 / 2 or C(^) < C(*i) + n~" / . We claim the latter. Observe that if 
CO) > C(*l) + n ~ 7 then P^|) - P^P imply = n(n 1 ~ dr '^ d ~^) and y = f^n 1 " 27 ). Together with 

(p7D . (I2HD . this implies that v{t 2 )z{t 2 ) = ft^ 1 " 37 ) > n 1 / 2 . 



In summary then, whp the process satisfies 



^2/3 

(i ~t 2 jny 



W) - y{t)\oo < 2Err(t 2 ) = O ( „ '\ , ) = (n 2 / 3+Kd ^^) forl<t<t 2 (29) 



and 

Vl (t 2 ) = 0(n x ' 7 ) where t 2 = h + 0(n^ 7 ). (30) 
We use (EE1) for (HI and (ED, (I25D for ffl). 



6 Analysis of Phase 1 

We will first argue that whp Phase 1 is sufficient to find a matching from L to R when there is no 
solution <(< ad to 

i 

X)"+.-<-l_0. (31) 

It follows from (|17p . (I2ip and (I30p that in this case Phase 1 ends with there being at most 0(n 1_7 ) 
vertices of L left unmatched, whp. Furthermore at time t 2 we will have 

d 

( ( \ d ~ 1 4=2. 
w ~ — - n, v\ ~ dw and v = 0(( d - 1 vi) 
\ad J 

where ( = ((t 2 ) = 0(n" 7 ). 

Lemma 10 Suppose that t\ = 0. Then whp at time t 2 ,T is a forest. 
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Proof: Let R>2 denote the set of vertices of degree at least two in the -R-side of T. Let P(d±, . . . , d k ) 
denote P(X\ = d%, . . . , X k = d k ) where X\, . . . , X v are truncated Poisson conditioned only to sum 
to dw — v\. For large k we use the bound 



k j. 



P(X l = d h ...,X k = d k )< 0(n l l 2 ) J] jjj^. (32) 



For k = 0(1) and d%, . . . , d k = 0(log n) we write 

k k d 

P(Xx = d 1 ,...,X k = d k ) = H P(X t = di | Xj = dj,j < i) = (1 + o(l)) H (33) 

i=i i=i l ^\ z > 

It is equation (|7|) that alows us to write the final equality in (|33p . The extra conditioning Xj = 
dj,j < i only changes the required sum. 

Thus let B k denote 0{n 1 / 2 ) for k > an d 1 + o(l) otherwise. The expected number of cycles 

can be bounded by o(l) = (P(3 vertex of degree > logn)) plus 

EE E m.-,*)x s 

fe>2 5CK> 2 2<d 1 ,...,d fc <logn v 7 i=l v /v 1 

|S|=& 

(34) 



M) *-f „ LL 1 d l lf(z) \kj\2j { 11 (dw-2i + 2)(dw-2i + l)) 



Y, Bk {\ 1 ^ 11 ~ — K 1 ! 1 ! (/, ' !) ' 1 1 — — 

k>2 ^ ' 2<di,...,d fc <logni= 

VWZ d \ - - — T-" 1 z 



(/(z)(d W -2fc) 2 ) s n 

fc>2 VJ v /v ; 7 2<d 1 ,...,d k i 



E ( vwz 2 e z d 2 \ k _ 
kt2 k \f{z){dw-2kf) 

Ed-2 , 
B k O((—) k = 

k>2 
0(1). 

□ 

Explanation of ()34[) : We condition on the degree sequence. Having fixed the degree sequence, we 
swap to the configuration model [5j. Having chosen S C R and k vertices W in L and their degrees, 
we can work within tnis model. We then choose a cycle through these vertices in (k\) 2 ways. We 
then choose the configuration points associated with our /c-cycle in (d(d — l)) k Y\i=i d%(di — 1) ways. 
We then multiply by the probability Yli=i (dw-2i+2)(dw-2i+i) °^ choosing the pairings associated 
with the edges of the cycle. 

Corollary 11 Suppose that t\ = 0. Then whp at time ti, T contains a matching from L-p into 
Rr- Furthemore, such a matching will be constructed in Phase 1. 

Proof: We can assume from Lemma [10] that T is a forest. Each vertex of Lr has degree d and 
so Hall's theorem will show that the required matching exists. (Any Hall witness would induce a 
cycle). Finally note that Phase 1 of the Karp-Sipser algorithm is exact on a forest. □ 
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6.1 Threshold for Phase 1 to be sufficient 



Put A = {ad)- 1 /^-^ and B = l/(d - 1) so that d3TJ) can be written as 

A( B + e" c - 1 = 0. (35) 

Assume B is fixed. We find a threshold for A in terms of B for there to be no positive solution to 
(I35p . We find the place £* where the curve y = 1 — touches the curve y = A( B i.e. where 

A( B = !_ e -C 
AB( B - 1 = e~< 

In which case 

^ = e« - 1 and A* = -^p- (36) 
or ^ 

c * = and a * = dii-U-Y- 1 ' (37) 

In general, keeping B < 1 fixed let 

/ A (C) = A( B + e-f - 1 and L A = A" 1 ^. 

We must show that if ^4 > ^4* then the only solution to /a(C) = 0, < is C = 0. 

Observe that /a(La) = e _LA > and /a(0) = 0. Also, if A < 1 then 1 < L4 and if A < 1 - e _1 
then /a(1) < and there must be a positive solution to /a(C) = 0- 

Observe that A' > A implies (i) La> < La and that (ii) /a'(C) > /a(C) f° r C / 0. So if /a(C) nas 
no positive solution then neither has Ja>- We argue that f'AC) = has at most 2 solutions, which 
implies that Ja(C) = has at most two positive solutions. As we increase A to A* these solutions 
must converge, by (ii). 

Now 

But the function g(£) = is convex for any £ > 0. Indeed 

/(CH eC((c - { S 2+a >° 

and so 5(C) = a has at most two solutions for any a > 0. 
6.2 Finishing the proof of Theorem [2] 

We now have to relate the above results to the actual process. We know from our analysis of the 
differential equations that for some A > 0, 

v 1 {t 2 = t 1 + An^) = 0{n 1 -^). 

When £1 = C(ti) = 0, Lemma [TOl and Corollary 1111 imply Theorem [2J 
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So assume that £i > 0. Thus a > a* and A < A*. We argue that if za is the solution to (J35J) then 
za decreases monotonically with A. Indeed, ii A' > A then /a'(C) > Ia(C) f° r (a < C < Aa' < ^M- 
Now ^z^i is strictly monotone decreasing with z and so 

{d ~ l)ZA < {d ~ l)ZA * = 1. (38) 

e Z A - 1 e Z A * - 1 V ^ 

The second equation in ([38}) is the first equation in ([36]) . 



At time t 2 we will have v\ = 0(n x 7 )) and v = For the next o(n) steps we have from ([9]) 

that 

EK - vi I 0) = -! + (! + o{l)) d -^jV = -1 + (1 + ^"^f < -e (39) 

for some small positive e. In which case, whp, v\ will become zero in 0(n 1-7 log n) steps. Indeed 
([39}) implies that the sequence 



vi(t 2 + k) + ek if Vl (t 2 + k)>0 
Xk-i otherwise 



is a super martingale that cannot change by more than d in any step. The Azuma-Hoeffding 
inequality implies that for T = 2Xq je we have 

P(ui(t 2 + T) > 0) < P(X T -X >eT- X ) < exp |- ^"J^ } = o(l). 
I.e. whp vi(t2 + T) = and u(i2 + T) = f (^2) — o(n) = U(n). 



7 Proof of Theorem [3] 

Let us summarize what we have to prove. We have a random bipartite graph Ti with partition 
L\,Rx and \L\\ = n\ = aimi, = mi. Each vertex in Li has degree d and each vertex in R has 
degree at least 2. At this point it is convenient to drop the suffix 1. So from now on, m, n, a, T etc. 
refer to the graph left at the end of Phase 1. 

The degrees of T satisfy, di{a) = d for a G L. The degrees of vertices in R are distributed as 
the box occupancies X\, X 2 , . . . , X n in the following experiment. We throw dn balls randomly into 
n boxes and condition that each box gets at least two balls. In these circumstance the Xj's are 
independent truncated Poisson, subject to the condition that X\ + X 2 + • • • + X n = dn, see Lemma 
[6] with v\ = 0. Thus for any S = {61, b 2 , . . . , b s } C R and any set of positive integers ki > 2,i £ S 
we have 

k 

P(d Rl (k) = k t ,ieS)< Oin 1 ' 2 ) Yl 

ies l ' J ^ ' 

for k > 2 where z satisfies 

z(e z — 1) _ nd 
f(z) m' 

The Oiji 1 ! 2 ) term accounts for the conditioning ^2beR^ R ^ = ^ n ^ e P rove 
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Theorem 12 Let V be a bipartite graph chosen uniformly from the sets of graphs with bipartition 
L,R, \L\ = n, \R\ = m such that each vertex of L has degree d > 4 and each vertex of R has degree 
at least two. Then whp 

/x(r) = min {m, n} . 

7.1 Useful Lemmas 

Define the function ( (7), 7 > to be the unique solution to 

u(e u - 1) 
7H 

Let g be defined by 



7- 



<?(*) = (e«*> - lJ'/CCCx)) 1 - 

Observe that 



/(C(*)) = f40) 

C(a;) x x x 1 j 



Lemma 13 The function g{x) is log-concave as a function of x 

Proof: We will write ( for ((x) and / for f(Q throughout this proof. Now ^ e ^~^ = x from which 
we get 

dC f 2 



dx ( e C - l) 2 - C 2 e c 

da: 

d d 



(41) 



and note that 4| > for £ > 0. Taking the derivative of log(g(x)) we get 



— log(5(ar)) = — log(e c - 1) + (1 - x) log(e c - ( - 1)) 



n , - 1\ dC / e« - 1 

log (-rJ + dl^ic3T +(1 -^-r 



Now X = ^-j — - SO 



e< ^e ^ -l Ce C f - ((e< - 1) e< - 1 

_ Ce C( e C _ C _ 1) + (e C _ C _ 1 _ Ce C + C )( e C _ 1) 

p 

( e C _ 1)2 _ C 2 e C 
dx 

= dC 

Thus we have 

±\og(g(x))=log(^-j^j+l (42) 
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Taking the second derivative we get 



d 2 , , ,» d (. /e c - 1 , , 
log( 5 (x)) = — log — — + 1 



dx 2 dx \ \ f 

f e *( e C _ C _ i) _ ( e C _ i)2 d( 

3^ — 1 f 2 dx 

-(C"l)e C -l N 



1 d£ I „ ^ C 



(e< - 1)/ 

and since — (£ — l)e^ — 1 is strictly negative for ( > we get that g(x) is log-concave □ 
Lemma 14 is concave as a function of x. 

Proof: We begin with (|4"T|) . We note from ([24"]) that the denominator 

( e C _ 1)2 _ C 2 g C > 0. 

Then we have 

d 2 C _ 2(1 + Q + e< (-6 - e 2 ^ (2 + C(C - 4)) + C 2 (5 + C(C + 2)) + e^(6 - 2((2( + 3))) d( 
dx 2 ((eC - l) 2 - C 2 e9 2 dx' 

Now let 

oo 

<P(u) = ^ <f>nu n = 2(1 + u) + ^(n) 

n=0 

where 

oo 

^(u) = J2 ^ n u n = e"(-6 - e 2 "(2 + u(u - 4)) + u 2 {5 + u(u + 2)) + e u (6 - 2u(2u + 3))). 

n=0 

We check that ipo = —2 and ^1 = which implies that <j)Q = <f>\ = 0. One can finish the argument 
by checking that 

3 n-2/ n 2 _ 13n + i 8 ) + 2 n( n 2 + 2n _ 6) - (n 4 - 4n 3 + 10n 2 - 7n - 6) 

V>n = j < 

for n > 2. This is simply a matter of checking for small values until the 3 n term dominates. □ 
Next let 

/ e u — u — i ' 
ff(it) = log f(u) - u - 2 log u = log 5— 

Lemma 15 H(u) is convex as a function of u. 
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Proof: 



*_JI( \- d ( eU ~ l 1 2 
du 2 {) du\f(u) u 



_ e u (e u -l-u)-(e u - l) 2 _2_ 
f 2 (u) u 2 
_ e n - 1 - ue u 2 
f 2 {u) u 2 
u 2 (e u - 1 - ue u ) + 2(e u - 1 - u) 2 
u 2 f 2 (u) 

2e 2M + u 2 e" + tt 2 + 4u + 2 - u 3 e n - 4ue n - 4e n 
m 2 / 2 (m) 

Let 

oo 

<f>(u) = 2e 2 " + uV + n 2 + 4n + 2 - u 3 e" - 4-ue" - 4e" = ^ $ n u n . 

n=0 

Direct computation gives 4>o = 4>\ = <p2 = and for n > 3 

1 

n\ 



-(2 n+1 + n(n - 1) - n(n - l)(n - 2) - 4n - 4). 



One can then check that ^3 = ^4 = (^5 = < cj) n for n > 6. Thus -^H{u) > implying that i7 (u) 
is convex. □ 



7.2 The case m ~ n 



We will first prove Theorem [12] under the assumption that m = n and then in Sections 17.31 and 
17.41 we will extend the result to arbitrary m. We will as usual prove that Hall's Condition holds 
whp. We will therefore estimate the probability of the existence of sets A,B where |^4| = k and 
\B\ < k — 1 such that iVr(^4) Q B. Here N-p(S) is the set of neighbours of S in T. We call such 
a pair of sets, a witness to the non-existence of a perfect matching. There are two possibilities to 
consider: (i) A C L and B Q R or (ii) A C i? and B C L. We deal with both cases in order to 
help extend the results to m / n. We observe that if there exist a pair j4, B then there exist a 
minimal pair and in this case each b E B has at least two neighbours in A. We deal first with 
the existence probability for a witness in Case (i) and leave Case (ii) until Section 17.2.21 We then 
combine these results to finish the case m = n in Section 17.2.31 We will deal computationally 
with minimal witnesses where each vertex in B has at least 2 neighbours in A. If v has a unique 
neighbour w in A then A \ {w} , B \ {v} is also a witness. 



7.2.1 Case 1 



We estimate 
n L (k,£,D) = 

P(3A,B : \A\ =k,\B\=£< mm{k - 1, m/2} , N r (A) = B, d(B) = D,d A (b) > 2,6 e B) < 
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0(n 



l/2v 



n \ I m 



E n 



d b 

kj V t J dh\f(z) \xh 

Eb 6 [fl <%=£> 
E 6 ^[£] d b =dn-D 



v dfc-1 

) <">■ n sr 

7 i=0 



^ 1 



2<Xi,,6eM6=l 
\J2 b x b=kd J 



m 1 

e riz 



2<d b ,bg[e] b=k 
\E b d b =dn-D J 



U[n \kj\e) (dn)\ f(zr 



(\u kd ){e u - 1 - uA (\u dn - D }{e u - 1 - u) m -^ ([u D - kd ] 



0<y b ,be[t] 6=1 
VEi^-D-fcd / 



n , i/ 2 , /n\ /m\ (d(n - fc))! (kd)\z dn fjzf f(Ci) m - k+1 I 



D-dk 



kj\£j (dn)\ f(z) m z kd Q d 



(D - kd)\ 



< 



(43) 



(44) 



where £i = £(y) < z where y 



dn-D 
m—k+ 



j > 2 due to our minimum degree assumption for R. 



,i/ 2 ,M/ m \ - fc))! (kd)\z dn fjzf- 1 /(Ci) m - fc+1 (fc - l)°- dk , , 

K \k)\k-\) (dn)\ f{z) m z kd ( dn - D (D-kd)\ ~ { ' 



O 



777/ 



. . n—k 

,d f ir,\^r\ / „u \ D ~ dk 



(*)(*) * d /(Cl)^ f efe 



Putting fc = an and m = /3n and h(u) = u u {l — u) 1 u and x = d — y where 0<i<rf-2we 
obtain, after substituting (^) = O ^ fcl/2 *^ n ^ etc. 



Explanation of ([43]): Choose sets A, B in (£) (™) ways. Choose degrees db,b € i? with probability 
0(77 1 / 2 )p]™ 1 g^-y such that Y,beB d b = D , T,b<jtB d b = dn - D for some D > 2{l). Choose the 
degrees x a , a € A in the sub-graph induced by A U B. Having fixed the degree sequence, we swap 
to the configuration model. Choose the configuration points associated the x a , a G A in n<ieA (x ) 
ways. Assign these D choices of points points associated with A in D\ ways. Then multiply by the 
probability (fcd)! 11^=0 1 dn-i °^ a §i ven pairing of points in A. 

Explanation of (JSJ to g5|): If A(x) = E^=o a « x?1 where a n > for n > then a n < A(C)/C n 
for any positive £ and A{Q/C > n is minimised at £ satisfying £A\Q/A(Q = n. 

For the remainder of Section 17.21 we assume that 

77 < 777 < 77 + o(77 7/8 ). (47) 
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In which case we have 



(0-1 \ n—k 
f(Cl)— \ = e o(n?/*a) 



Thus(fl6l) becomes 



(_^) e ^, k(o)M »(|.p(^)T"'. (48) 



n L (k,£,D) =0 



Case 1.1: < k < (l - §) n. 

Observe (see (J3QD) that 

z d /(Ci) d d ff(d-x) 



/(z) 9{d) {d - x) d ~* 

where g(x) is as defined in Lemma [T3l 
It follows from (gSJ that 



log ( £ ibr) = jL 1 log(5(t)) ^ - 1 j log(1 + Ce ~ C) + 1)dt - (49) 



Now £e ^ < e 1 which implies that log(l + (e > £e ^/10. Also, 

C(*)_, c >2 2 _ c 



t eC - 1 - C + 2 C + 2' 

And so C < t < C + 2. Thus 

t-2\ x(2d-A-x) 



il (log(1+Ce " <) + 1)dt£ /l( 1 + w)*^ + 



20e a 



This implies that 9<y g^ < e x, 0(a;) where ^(aj) = e £d ( 2d 4 ^ and = Note that d — x > 2 

and so "i/K 2 -) 5; e - ^ -2 -^ 1 in the range of interest. Plugging this into the last parenthesis of (I48p 
gives 

n{k , t , D) = o ( J^) e*""")«,)» (»(.)« (^(^)^ (?)Ty (») 
This immediately yields 

n(l-2/d) dfc+nV 10 n(l-2/d) dfc+nVio / . 

45H = E E **(M,I>)< E E o(^)/ t (a)(^e^)= (l). (51) 

£<k=e L n D=dk £<k=e L n D=dk ^ ' 

We use the notation ^Ij^yjso that the reader can easily refer back to the equation giving its definition. 

We will work with D < klogn because it is easy to show that whp the maximum degree in T is 
o(logre). The bound for y^jj comes from (|50p . using the fact that h(a) is bounded away from 1 and 
x = o(l) in this summation. ^5]] is the first of several sums that together show the unlikelihood 
chance of a witness. We will display them as they become available and use them in Sections 17.2.31 
Eland El 
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(/ a \ x \ 1 — a 
d d ( d _^d-x (^r) J ^ n dSni) is maximized when x = ad, provided 

ad < d — 2 or k < n (l — |). This in turn gives 



< o f J_) e^ 778 ^*)™ (a^U - a)- 2 ^Y (52) 

The function pd(a) = a a ^ d ~ 2 \\ — a)^ 2 ^ a ^ is at most 1 and is log-convex in a on [0, 1 — ^]. Indeed, 
if L\(a) = log Pd(a) then 

' iL] d-2 + (d-2)loga + 21og(l-a) (53) 



da 

d 2 Li _ d - 2 - da 
da? a(l — a) 



(54) 



We have ^i(O) = and L'^(0) = — oo. It follows that for every K > there exists a constant 
EL(K,d) > such that 

Pd(a) < e- Ka for a < e L (K). (55) 

We let e L = e L (l,d). 

We can immediately write 

n 1 / 10 fclogn n 1 / 10 logn . . 

4a = E E " ) - E E (S) <=°""""" ) - °» «") 

£<k=2D=dk £<k=2D=dk V ' 

The bound for is derived from ()52[) using ip(x),p(a) < 1. 
Along the same lines we have 

E^n fclogn e^n fclogn , 

^2= E E E Eo(^) e - l(1 -'""" /, » = »(i). (5T> 

^<fc=n 1 /lO D=<ifc e<k=n i/lO D=dk V 7 

The bound for j^gy] comes from (f52"j) and ([53]) . 

Now (l - !) <C( ~ 2 ' ) ^ (I)~ 4 ^ decreases in d and is < 1 for d > 4. So if d > 4 then 

n(l-2/d) logn n(l-2/d) fclogn / , 

^m= E E mm^h E E o(^)^ r/ ' a V(n-^r W (a)» = o(i). 

i<k=e L n D=dk+n 1 / 10 t<k=e L n D=dk+n 1 / 10 

(58) 

The bound for ^ [5g] comes from ([52]) using the fact that pd(a) < e a and x > n 4 / 5 in this 
summation. 
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When d = 3 we need some extra calculations. First note that ps(.15) < 1 and so arguing as above 
we have 

.15n fclogn .15n logn , . . 

Am= E E «L{k,i,D)= y: E o( l ^)e^ 7/s Mn- 4/5 ) n P3(ar = o(i). 

£<fc=e L n D=3fc+n 1 /io t<k=e L n D=3fc+nV 10 ^ ' 

(59) 

Because we can choose any value for (i in the bound (I48j) we can simplify matters by choosing 
Ci = C independent of x to get 

™* "> ^ ° (^) (75, f (^)T" • («) 

Now 

^ V <exp(-^1 (61) 



(1 — a)x y [ 1 — a 

and so 



j ■ (62) 

Now the function ^(a) = h(a)e^ a (^y^ef^l is log-convex. Our choice of £ will be 1.5 and we 
note that with this choice L2(-15), ^(2/5) < .98 and so 



2n/5 fclogn 2n/5 fclogn , . 

: £ E *Uk,l,D)< £ E 0(^)e^)(.98r = (l). (63) 

£<fc=.15nD=3fc+n 1 /iO £<k=.15n D=3k+n l/W ^ n ' 

We have gone slightly beyond n/3 to 2n/5. It is convenient to repeat this idea for a couple of 
ranges. Putting £ = .5 we get £2(2/5), L2 (.74) < -995 from which we deuce that 

,74n fclogn .74n fclogn - . 

A mr E E ™<m,j»<; £ £ ofeW"^^^). 

^<fc=2n/5D=3fc+n 1 /iO ^<fc=2n/5 D=3fc+nVio V J 

(64) 

Putting £ = .2 we get L2(.74), L2(.87) < .995 from which we deuce that 

.87n fclogn .87n fclogn . . 

Js m= E E **(m,i>)< E E o(^)e^)(.995r = o( i). (6 b) 

^<fc=.74/j =3fc+n i/io ^<fc=.74D=3fc+nVio V J 

Case 1.2.1: (l - |) n < fc < (l - n. 

For a > 1 — § the maximising value for cc in (|50p is at x = d — 2 (recall that < x < d — 2), so 
plugging into ([50]) gives 



A; 



*L(k,£,D) = ^» <^( 



rW 5 
fc 



= o(n-V8 a)/) ^^_ 2 [ d d}_ (j-a 



d-2\ !- aN 



1/2 



e o(n- 1 /8 a ) a d-2 



2 2 Vd-2 / 
,/ \ l-a\ " 



(d - 2) d ~ 2 2 2 



(66) 
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Lot L,(o) - log ( a d ~ 2 ( {d _$- i2i y ^ Then 



d d ( ( d d 

" L 3(a) = — (d - 2) log a + (1 - a) log 



da ay ' da \ y ' 6 v y & V( d - 2 ) d ~ 222 
d - 2 

= d log d + (d - 2) log(d - 2) + 2 log 2 

a 

Assume for now that d > 6. Then the derivative with respect to d, for of the last expression is 

^- 1O8 (d 1 2) al - 1O8 (l) >0 

so it takes a minimum at d = 6 with a value 

4 

- -61og6 + 41og4 + 21og2 > 0.18 > 0. 

a 

Now £3(1) = and so for a > 1 — | and d > 6 we have 

n— n 7 / 8 fclogn n— n 7 / 8 fclogn , . 

£<k=n(l-2/d) D=dk £<k=n(l-2/d) D=dk V J 

(67) 

For d = 3, 4, 5 we use the following 
Claim 16 For y > 2 we have < §. 

Substituting this into (|48p gives 

, l(M , D) . (_L) ( Mo) -( e *-.)_g.|(fs)yy (68) 

7%) I ( ^ ~ J J i s taken when either x = for a € 
[1 - §, 1 - ^j] or at x = d - 2 for a € [1 - ^j, 1]. 

So for a € [1 — |, 1 — -t^t] we get 



1 \ / ^fv.-i/8„i. , ^,10 „ ( 3z 



l-a 



<2 \ 



The expression L4(a) = /t(a) d 2 e a ( g^y j is log-convex on [1 — |, 1 — ^rj] and L4 < .97 at both 
ends of the interval for both d = 3, 4, 5. We can therefore write 

n(l-l/(d-l)) fclogn n(l-l/(d-l)) fclogn , . 

^ E E«(M,0< E EofilV^W-^ (69) 

£<k=n(l-2/d) D=dk £<k=n{l-2/d) D=dk V 7 

for aG [1-1,1-^]. 
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Case 1.2.2: k > (l - n. 

For a e [1 - 1] from ([68]) we get 

Tl(M , D) .o(-i 5 )U--.K )-v( i ^ y ) (--^ ^ 



d-2 

1 \ / / o ci d—2 \ 1— a\ n 



The expression L^{a) = a d ~ 2 ^f^^-2) d ~ 2 ) * s l°g- concave on [1 ~~ gzjj !]■ The derivative of 
logLs at a = 1 is at least 1/100 for both d = 4, 5. Consequently for d = 4, 5 

n _ n 7/8 fclogn n— n 7 / 8 fclogn , . 

4ZQT E E < E E ° ( }2 m) e-(«- fc )/^- 1/8fc ) = 0(1). 



(70) 



£<fc=n(l-l/(d-l)) D=dfc £<k=n(l-2/d) D=dk v 7 

for a G [1 - 3^,1]. 
For d = 3 we go back to (|60l) and ([6Tj) and put £ = (1 — a)/a giving 

»> * o (^) (^ % ) ( 7 ^^^ ! f i ^) "T • (-) 



Now for x < 1 we have 



o 

X / X 



/(x) < T (1 + 3 



Plugging this into (|7ip for a > 1/2 and replacing 2a + 1 < 3 we have 



3 \ !- a 



n 



^W^Wf °°lW)J J ' (72> 

/ 3 \ 1— a 

The function Lq(cl) = a a ( 2 e f( z ) ) ^ s log-convex and Lq(.84) < .9995 and ^(1) = 1- Also, 
L' 6 (l) > 1/20. It follows that 

n— n 7 / 8 fclogn n— n 7 / 8 fclogn , , . . 

423 = E E MM,^) < E E 0(^^V min «"- fc ^^ 

e<k=.84n D=dk £<k=.84n D=dk ^ H ' 

(73) 

Proof of Claim [16} Recall from LemmaOthat C,(x) is concave and thus ^^_^ 2 ^ is decreasing. 
Since ("(2) = we have 

c(y) _ ay) - cm 

y-2 y-2 

d , . 

dy 

. e z — z — 1 
= lim 



. -6 (e z - l) 2 - 
3 
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Thus we have that < 3(y — 2) for all y. For y > 3 we can upper bound 

1^1 < M < e l < e l <l 

C{y) y ~ y y ~ yy ~ & ~ 4 

For y G [2, 3] we have that 3(y — 2) < y and so we can bound 

f(C(y)) /(3(y-2)) 
C(y) y " (3(1/ - 2))w 

Taking the logarithm of this expression and substituting u = 3(y — 2) we get 

log/(«) - ^- + 2J logu = u - - log(u) + (log/(u) - u - 21og(u)) = u - - log(u) + #(u) (74) 

where .ff(it) is from Lemma [T5l 
Since H (it) is convex we have 

gM < g (0) + u g ' 3 >- g '°» 

3 

Pluggin this into ([73]) we get 

it - H bg n + F(0) + ix g(3) ~ g(0) = (0) + J ((#(3) - (0) + 3)u - u log u) 
which is concave in u and takes a maximum value of H(0) + it/3 when 

it = exp(H(3) - H(0) + 3 - 1) = — — < 0.33 

18e 

Pluggin this back in we see that for x G [2, 3], which is u E [0, 3] we have 

< exp(ff(0) + .11) = exp(0.11)/2 < \ 

□ 



7.2.2 Case 2 

Now let us estimate the probability of a violation of Hall's condition with A C R. We once again 
begin with arbitrary m. Let 

TT R (k,£,D) = 

P(3A CR,B C L : = Jfe, |J3| = I < mm{k - l,n/2},N T (A) C B,d B (b) > 2, 6 G B,d R (A) = D 

°<» i/2 > (!)(") s n4nCJ D! n^^ < 75 > 

V K / W 2<d a ,ae[m] a=l flo - / ^ 6=1 W i=0 tfn * 

2<z t ,<d,&e[£] 

£a£[fc] d a =dn-D 

, n i/2) a^a f«\ zdn - D! ( dn - -°) ! 



kj\£jf(z) m (dn)\ 
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k 

e ni 



0(n 



m\ n\ z 



2<d a ,a£[k] o=l 
dn ^ 



4! 



n 

e n jt 

2<d a ,a^[fc] a=fc+l a 
\E a d a =dn-D 



e n 

2<x i) <d,&e[£] 6=1 
\ I2b x b=D 



Xb 



0(n 



x ([^](e U " 1 " «) fc ) ([u dn - D ](e u - 1 - u) m - fe ) ([u D ]((l + u) d - (1 + du)) 1 

l/2\ / m ^ / ; ' ^ ' 



< 



\kj\k - I J f{z) m (dn\ 



1 - u) 



[u dn - D ]{e u - 1 - u 



,m—k \ ( \„.D 



[u u ]{(l + u) a -{l + du)) K \ < 



n( v^( m \( n \ zdn 1 /(Ci) fc /(C2) m ~Y 
[ \kj\k-i) f(zrt^) cp ct~ D v 



where £i = C(D/k) and C2 = C 



' dn-D 



'dk 
D 



< 



m—k 



actually any value for £1 , £2 is valid 



(76) 



0(n 



1/2) f 



m\ n 



(my (f{C2)\ m - k (z_\ D (z- 

\kj\kj (%)\f{z)J \f(z)J VCJ VC2. 



o 







' Hea/dr ff(Ci)\ a rm)y- a (z_ 

nWj \ h{a)h(a/P)Ph(p/d)<* \ f(z) J \ f(z) J \(i 



h(9a/d) c 



z d /(Ci) a /(C 2 ) /3 - aV 



n 1 / 2 J\h(a)h(a/P)f } h(9/d) ad f(z)' 3 Ci a ( d ~ 9a 



(77) 
(78) 
(79) 



where a = k/n, m = (5n and D = 9k < dk. 

Explanation of (|7g]l : Choose sets A, S in (™) (™) ways. Choose degrees d a , a £ R with probability 

©(n 1 / 2 ) n«=i d 1/( z ) such that ^2 aeA d a = D, Yla^A^-a = dn — D for some D > 2k. Choose the 
degrees Xb,b € B in the sub-graph induced by AUB. Having fixed the degree sequence, swap to the 
configuration model [5]. Choose the configuration points associated with the Xb,b G B in Hfees CrJ 
ways. Then multiply by the probability D\ J^^q 1 ~rzr- of a given pairing of points in A. 

We assume that (|47p holds for the remainder of the section. In which case we have 



h{a) f(C2f~ l 
h{a/f3f f(z)^ 



Thus, (1791) becomes 



TT R (k,e,D) < O 



n 



1/2 



An -i/, a) K9a/d) d z d my m 



l-a 



h(a) 2 h(9/d) ad f(z) (f a ( d ~ ea 



(80) 



It follows from Lemma [13] that we can upper bound 



l-a 



d d g(ey g (^ 



9(d) 



l-a 



d-ad 
l-a 



d-a6 
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< g(ag+a-a)gf) d d 

9(d) n a e(d-ae^ d ~ 



1-a 



a a9 (l - a) d ~ 



a ad (l-a) d -'" 



Plugging this into ([50]) gives 



1 \ f P o(n 1 / 8 a) n a0(-t _ \d-a6\ n 

n R (k,l,D) < O ( * ) 6 - (si) 



Now let RU9) = log f a ° ° d ■ Then 

-R'i(^) = a log a — alog(l — a) — a log 6* + alog(d — 9). 

Thus R\(9) is concave and is maximized when 9 = ad. Because 9 > 2 we can only use this for 
a > 2/d. 

Case 2.1: k > 2n/d. 



7r R (k,£,D)<0 



O 



1 \/e°( n 1/8a )a a2d (l-a) d " a2d \ n 



V2 J [ h{a) 2+ad 



,o{n-V E a) a 2 d-a(2+ad) /-, _ \d-a 2 d-(l-a)(2+ad)\ n 



O f _M ^o(n-VBa) -2a (1 _ a) (d-2)(l-a)) n 



J_Vo(n^a) pd(1 _ a) n (g2) 



where the function is defined following (|52p . 
We find that 



(du 2\ {d ~ 2)2 V /d 

Pd (l-2/d) = I _ M - - J <.9ford>5. (83) 
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Now p d (l - 2/d) < 9/10 for d > 5 and p 4 (2.01/4) < .997. So, with the aid of (J55D , 

n— n 7 / 8 fclogn n— n 7 / 8 / i \ fclogn 

Hi= E £**(m,j>)< E K ( " 7/8a) E e " n7/8 /-^ 5 - ( 84 ) 

£<k=2n/d D=2k £<k=2n/d ^ ' D=2fc 

n _ n 7/8 ^ n _ n 7/8 ^ 

%a= E £*«(M,i>)< E o (i) * 0(n7/8a) E ^ 7/8 /- d = 4 - 

(85) 



£<fc=2.01n/4D=2fc ^<fc=2.01n/4 x 7 D=2fc 



We will treat d = 3 and /c > 2n/3 under Case 2.2. 
Case 2.2: 2 < k < 2n/d. 

In this case the expression in (|8ip (ignoring error terms) is maximized at 9 = 2. Then 

1 \ / e>o(n -1 / 8 a) 2a /i n\d— 2a 

7T«(fc,Z>)<0' W 

= o 



« 1/2 / V /i(a) 2 /i (f) ad J 

1 \ / e o(n" 1 / 8 a) ^ _ a W-2a-2(l-a) \ " 



n l/2 y ^ h ,T\od 

1 \ / e °("- 1/ M(i_ a )^2\ n 



1/2/1 , /2\ ad 



my 



Let i? 2 (a) = log [ (J 0y^]- Then 



i^( a ) = — — - -dlog/t(2/d) < ford>6. 
1 — a 

Thus i?2( a ) is strict concave and its maximum is taken at a = and i?2(°) < i?2(0)a for all 
a € [0, §]. Furthermore, i? 2 (0) < -3/10 for d > 6. It follows that if d > 6 then 

n 1 / 10 dk n 1/10 dk 

m = E E**(m,s)< E E !^)^" 178 "^^)- ( 86 ) 

£<k=2D=2k Kk=2D=2k V 7 

2n/d dk 2n/d dk / i \ 

^ = E E^(k,i,D)< ^ E (^) e-^- 1 ^ = o(l). (87) 
For d = 3, 4, 5 we use a better bound on [u D ]((l + u) d — 1 — du) k in ([7^1) . 
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Case 2.2a: d = 5. 



[u D ]((l + uf-l- 5u) k = [u D ](Wu 2 + Wu 2 + 5u 4 + u 5 ) k ) 

= [u D - 2k ](W + Wu + 5u 2 + u 3 ) fc 



2 10 



< lO k \u D - 2k ] 1 + 



10 



3fc \ 
fc \D-2k) 



Replacing the 1 d factor in (f8Tj) which comes from (^f) gives, for d=5, 
Hi) 



7T R (k,D)<0 



O 



k \ ( e°( n 1/8fit )a ae (l - af~ a6 ' 



nV 2 
k 



n 



1/2 



o 



nV 2 



10 



/i(a) 



o(n- 1 / 8 a)^Qa a a(9-2) n _ a ^-2-a(6-2) 



2 e -ih( e - 



-2\3\ a 



( 



( ( a X^ \ 3a \ 



e o(n 1/8 a) 10 «(i _ a ) 



V 



1-a 



2^0, (M) 

V 3 / 



Let p(x) = -^jpj for any x 6 [0, 1], note that if P(x) = logp(x) then 

P'{x) = log q — log x + log(l — x) 

P»( x ) =-- — < 

X 1 — X 

and so p(x) is maximized when log q = log ( J or x = and the maximum value is 1 + q 



Thus from ([88]) we get 



ir R (k,£,D)<0 



1/2 



3 o(n 1/8 «)io a (l-a) 3 (\ 



+ 



2(1 - a) 



3a x 



Let R 3 (a) = log ( 10 a (l - a) 3 f 1 + ) ) . Then 



3 a 



fl£(a) = log 10 - + 3 log ~ ~ " 



2 - a 



2 - 2a 



RUa) 



3a 



(2-a) 2 (l - a) 



> 0. 
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So R 3 (a) is log-convex on [0, §]. We have R 3 (0) = and 1?£(0) = log 10 - 3 < -3/4 and 12 3 (2/5) < 
-1/4. It follows that 



2n/5 5fc 
£<fc=2D=2fc 



^<fc=2D=2fc 

Case 2.2b: d = 4. 



= o(n _1 / 8 a) 



2n/5 5fc 



-(3/4+o(n- 7 / 8 ))fc 



o(l). 



u D ]((l + uY-l- Auf = [u u - 2k }{6 + 4u + u 2 ) 



k _ r„.D-2k] 



,2\k 



ikr.D-2ki 



= 6> 
<Q k \u D - 2k ] fl + 



4 it 

1 + ~iH 

6 6 



2\ k 



6 



2fc \ 
k \D-2k) 
2-D-2A; • 



7r R (k,£,D)<0 



O 



O 



1 



nV2 
1 



1/2 



1/2 



O o(n x / & a) a a9 — Q,) 4 ~ ae \ " 



2 e -2/i 



2a N 



; o(n- 1 /8 a ) 6 a a a(e-2)^ 1 _ ^2-a(0-2) 



o(n 1/8 a)ga^ 



V 



1-a 



6-2 \ 2a N « 



\ 



\ 



< (rT7^) (e°("- 1/Sa )6 a (l-a) 2 (l + 



2(1 - a) 



2a > 



Now if 124(a) = log ^6 a (l - a) 2 (l + ^r^y)^ then 



" (2 -„) "(!-<■) > »• 



Thus 12 4 is log-convex on [0, \}. We have 12 4 (0) = 1 and 12 4 (0) = log 6-2 < -1/5 and 12 4 (2.01/4) < 
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-1/20. It follows from this and ([55]) that 



2.01n/4fclogn 

%n= E E **(m,i>)< 

£<fc=2 D=2fc 
n 1 / 10 4fc 



£<fc=2D=2fc 

Case 2.2c: d = 3. 



t -<- <±re / i \ 2.01n/4 4fc / -, \ 

E E o (^) e°(»-"-"' + Y. E» i 



3 -(l/5+o(n 1 / 8 ))fc 



^]((1 + u ) d - 1 - 3u) fc = [u^ 2fc ](3 + u)> 

j3fe-D/ & 

£> - 2k 



o(l) 



n R (k,£,D) <0 
= O 

= O 

< o 



1 



n 1 / 2 
1 

1 

U^J 2 

1 



h(0-2) 

3 3 " 



^ e o(n-V8 a)aa(e _ 2)(1 _ o) l-a(e-2) f ^ 

Ok 



(fa ^ ° S " 



a o(n 1 / 8 a)qa 



3 a (l - a) 



h(0 - 2) 



3 o(n 1 / 8 a)oO 



3 a (l - a) 1 + 



3(1 -a) 



3 , /3-2a 

h log 

3 - 2a & V 1 - a 

4a - 3 



(3 - 2a) 2 (l - a)' 



Now if Ife (a) = log [3 a (l - a) (1 + 3^ J j then 

4(a) 
12 5 '(a) 

Case 2.2c(i): .51 < a < 1. 

Thus -R5 is log-concave on |] and log-convex on [|, 1]. We have 12s(l/2) = 0, 12' 5 (l/2) < 
and 12 5 (3/4) < -.04 and 12 5 (1) = and 12^(1) = 00. It follows that 

n-n 7 / 8 3k 

m= E E**(m,j>)< 

l<k=.51nD=2k 

3n/4 3k / 1 \ n-n 7 / 8 3k 

E E» iV^ MW *^+ E E=-"" 8 = °w 

£<fc=.51n D=2fc ^ ' £<k=3n/4D=2k 

Now let us consider < < .51ra. 

Case 2.2c(ii): < a < .51. 

(a) 6» > 2.0005 and < a <. 
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We go back to ([80]) and make the choice £1 = C2 = z and replace h(9/d)~ by ( h(8-2) ) an( ^ 
consider the function 

so that ir R (k,e,D) < O (-^\ Fi(0,ct) n . Let (?i(0,a) = log(Fi(6», a)). Then 

9^ /27(l-af(M^\ 

~ lo S 0^2.0 fl x3-fl/o ImT ( 92 ) 



89 V V 9 / V 3 

d 2 Gi _ (3-a)fl-6 

9a 2 ~ a(l-a)(3-a0) 

9 2 Gi _ ((3 - a)# 2 - 120 + 18)a 



9a fa V 3 e a 2 (3 - 9f- 9 (3 - a9) e 

dGl a (log - log(6> - 2) + log(a(9) - log f 1 - ^) ) (93) 

(94) 
(95) 



d9 2 9(3 - a9)(9 - 3)(9 - 2) 
It follows from (El) that 



2^ g 

G*i(#, a) is a convex function of a for < a < a# = — - — , for 9 fixed, 2 < 9 < 3 (96) 

9 

and 

Gi(0, a) is a concave function of a for a# < a < 1, for 9 fixed, 2 < 9 < 3. (97) 
It follows from (1951) that 



Gi(9, a) is a concave function of 9 on [2, 3] for a fixed, < a < 1. (98) 
A calculation shows that if g\(9) = Gi(9,ag) then 

^)-T^{w^w) (99) 

^)^-^ y (-^2(3-,)log(^)). (100) 



Furthermore, if g 2 (9) = ^ | 0=ae then 



92(9) = log ( ¥ ^-^). (101) 



(a) 2.0005 <6<<3andO<a< e" 10000 . 

For a < e - 10000 we have ^ < log 10 - (6 - 2) log 1/a < -2. So, 



Fi(0,a) < e~ 2a for < a < e - 10000 , 2.00 05 < 9 < 3. (102) 



(b) 2.46 < 6> < 3 and e" 10000 < a < .51. 

Now a e > .51 for 9 > 2.46 and so ([96]) implies that Gi(9,a) < max {Gi(0, e" 10000 ), d(9, .51)} for 
2.46 < < 3 and e" 10000 < a < .51. Now ([T02|) implies that d(2.46, e" 10000 ) < -2 e - 10000 and 
(pi) implies that ^ le= 2 .46,a= e -ioooo< and so (J98|) implies that Gi(0, e~ 10000 ) < -2 e - 10000 for 
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2.46 < < 3. Also, by direct calculation, we have G\ (2.46, .51) < -.002 and ^ |e=2.46,a=.5i< 
and so G x {9, .51) < -.002 for 2.46 < < 3. Thus, 

Fx(0,a) < e - 2e ~ 10000 for e' 10000 < a < .51 and 2.46 < < 3. 



(c) 2.0005 < < 2.25 and e" 1000 < a < .51. 
We take Ci = -6 and C2 = 2.1 in ([SD]) and let 



F 2 (0,a) = F 1 (g,a)^ /( ^ / ^; B = F l( 0,a)e^a+^ 

where 

oP2 _ ^ 3 /(C 2 ) ™ _ /(Cl) _ C: 



2 



pr^ — ' ' p ^ — — ' : f> ^ — 



Let G 2 (0, a) = log(F 2 (0, a)). ^ = ^ and ^ = ^ and so and ® hold with d 

replaced by G 2 . Putting 72 (0) = G 2 (9,a e ) we see that 7 2 '(0) = ^'(0) - > 0, using (fTOOll (a 2 < 
-3.127). Thus 7 2 is convex on 2.0005 < 6 < 2.25. Furthermore 7 2 (2.0005), 7 2 (2.25) < -.00003 and 
so 72 (0) < -.00003 for G [2.0005,2.25] and therefore G 2 {9,a) < -.00003a/a e < -.00003a when 
< a < a e and 9 G [2.0005, 2.25]. Next let <£ 2 (0) = ^ \ a =a e - We have 2 (0) = 5-2(6*) + a 2 + t 2 9 < 
-.05 for 2.0005 < 9 < 2.25, using (fTUTD (r 2 < 1.253). So, G 2 (0,a) < </> 2 (0) - .05(a - a e ) for a > a e 
when G [2.0005,2.25]. Thus 

F 2 (0,a) < e -' 00003a for e" 1000 < a < .51 and 2.0005 < 9 < 2.25. 



Now suppose that we repeat the idea of the previous paragraph, but this time we take (j = 
1.4 and ( 2 = 3 in ([80]) and use the same notation. Putting 7 2 (0) = G 2 (9,ag) we see that 
t£'(0) = g'{(9) - ±§f^ > 0, using CEED (cr 2 < -2.27). Thus 72 is convex on 2.25 < 9 < 2.46. 
Furthermore 72 (2.25), 72 (2.46) < -.05 and so 72 (0) < -.05 for G [2.25,2.46] and therefore 
G 2 (9,a) < -.05a/a e < -.05a when < a < a e and G [2.25,2.46]. Next let 2 (0) = ^ | a=ae . 
We have c/> 2 (0) = c/ 2 (0) + a 2 + t 2 9 < -.2 for 2.25 < < 2.46, using (fTOTT) (r 2 < .763). So, 
G 2 (0, a) < 2 (0) - .2(a - a e ) for a > a e when G [2.25, 2.46]. Thus 

F 2 (0,a) < e~ 05a for e~ 1000 < a < .51 and 2.25 < < 2.46. 



(d) 2 < < 2.0005 and e" 1000 < a < .51. 

For this we simplify our estimate of TTn(k, I, D) by removing some terms involving (3 from (|79p . 



Tr R (k,£,D) < P(3A CR,BCL: \A\ = k,\B\ = k- l,N r (A) C B,d B (b) >2,6gB)< 



0(n 



1/2n 



n 



k \k-l 



E 



2<c( a ,ag[m] 
2<z 5 <d,be[fc-l] 

EaS[fc] rf a=Ei,£[fc_l] S&=.D 



n it) »■ n 



i=0 



dn — i 



O 



m 1 



1 / 2 J \k 



n\ m 



\z D D\ (dn-D)\ 
jJW (dn)\ 



( \ 

k 

e 

2<d a ,aG[fc] a=l 



^-1 



e n 

2<x b <d,be[k-l] b=l 
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m 1 / 2 J \kj\kj f(z) k (<£) 

k \ fn\ fm\ z D 1 /(Ci) fc fdk 
ntf 2 ) \k)\k) f{z)* (£) Cf U 

fc \ //A/.A (*) //(Ci)^ ' - ^ 



O (z4 f - 1 - n) fc ) ( [„"]((! + „)- - (1 + ) < 



O 

o 



o 



m 1 / 2 J \kj\kj ( d g) \f(z) J VCi. 

1 \ / h(9a/d) d //(Ci) z 6 



mV2 ; \h(a)h(a/f3)Ph(6/d) ad \ (£ f(z) 



Now let 



'.mVV V h(afh(9/d) ad \ $ f(z)> 1 ' 

M^/3) 3 3( 3 -^ f /(Ci) ^ 

^3 If j " 



h(o)27»(0-2)« V C? /(*) 
We take Ci = -0001 and then 

/(CO " 



< .e 



Cf /(*) 

for 2 < < 2.0005. Keeping some slack, we define 

h(ea/3) 3 3^ a e 



.85a 



h(a) 2 h{9 - 2) a 

and G 4 (0,a) = log(F(0,a)). Now let 74(e) = G 4 (9,a e ). We have 7 4 (0) = g[[9) - and 7 4 '(0) = 
9i(0) + ^ and we find from (fTOOl) that 74 is concave on 2 < 9 < 2.0005. Furthermore 74(2) = 
and using fl99]) we see that 7 4 (2) < -.8 and so g x {9) < -.8(6 - 2) for 9 G [2, 2.0005]. So G 4 (0, a) < 
-.8a(9-2)/a e < -.8a(0-2) for < a < a e . Next let <f> A (6) = ^ |a=a 9 = 5 2 (6 , )--85. We see from 
(IT0T]) that # 2 (0) < --5 for 2 < < 2.0005 and thus G 4 (0,a) < --5(a - a fl ) for a > a e when e 
[2, 2.0005]. Replacing e"- 85 by e"- 86 in the definition of F 4 (0, a) we get F 4 (0, a) < e -«0~2)a/5+a/loO) 
for < a < .51 when 2 < < 2.0005. So, for some small constant c > 0, 

.51n 3fc .51n 3fc 

i to= E X>«(m,.d)< E E e ^ = °( 1 )- ( 104 ) 

Kk=2D=2k e<k=2D=2k 

7.2.3 Finishing the case m ~ n 

We repeat our observation that the maximum degree A in T is o(logn) whp. Therefore 
Case 1: m > n. 

^+^+^57J+^[58]+^[69]+%l+^89] d = 5 

^M\+ ^M\+ AS2\ + Am + ^(691+^81+ d = 4 

^^m\+ d = 3 

where the o(l) term accounts for P(A(r) > logn). We use ^8^1+^861+^871^° account for witnesses 
A C L, B with \A\>n- n 7 / 8 . This is because if A' = R \ B and B' = L \ A then \A'\ = m — k + 1 
and l-B'l = n — k and iVr(-A') C I?' and there will be a minimal witness A", B" with A" C A'. 



P(/i(r) < n) < o(l)+< 
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Case 2: m < n. 



PQu(r) < m) < o(l) + < 



^81+ ^89]+ ^57] d = 5 

.^Il+^IOl+^+^fc] d = 3 



We point out for use in the next section that our computations allow us to claim that we have 



n— n 3 / 4 fclogn 



52 52 K L (k,e,D) = 0(c 



-C(n 3 / 4 )> 



(105) 



k=n 3/4 D=dk 
£<min{fe-l,m/2} 



Our computations also allow us to claim that 

n-n 3 / 4 dk 



52 52 £,D)= 0(e 



-H(n 3 / 4 ) 



(106) 



k=n 3/4 D=2k 
l<min{k-l,n/2} 



7.3 The case m > n + n^ 5 

Let Q(n,m) denote the set of bipartite graphs with \L\ = n, \R\ = m that are d-regular on L and 
degree at least 2 on L. Here n + n 4//5 < m < dn/2. In fact suppose first that m < ^dn where 
^ < 1/2 is a constant. Suppose that G(n,m) is chosen uniformly at random from G(n,m). 

If there is no matching from L to R, then let a minimal witness A, B be small if \A\ < n 3 / 4 and 
large if \A\ > n — n 3//4 and medium otherwise. 



7.3.1 Small/Large Witnesses 

We go back to (gSD. We see that /(Ci) < f(z) implies that the term — z ^_ a J ^V_7" ( ) is 

f( Z )T=a d V 7 

maximised over /? > 1 when = 1. Next let -ff(/3) = j3 log h(a/(3) then H'(j3) = log(l — a//3) and 
H"{(3) = ptp_ a \ ■ Thus /i(a//3)^ is log-convex in /3 and so 

^(a//?)^ > exp {#(1) + #'(1)03 - 1)} = h{a)(l - a) P ~ h (107) 
Going back to (f52j) we see that now we have 

By taking £l(/3) in place of £l(1) we can take if = /? in (|55|) and plugging this into (|108p we see 
that 

n 3 / 4 fclogn / 1 \ / p-/ 3 " \ n 

£ E^(M,^)<0 ^ =o(l). (109) 

e<k=2D=dk v 7 vv ; 7 
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To deal with k > n — n 3//4 we treat this as k < ra 3 / 4 in Section 17.2.21 Indeed, if there is such a 
witness A,B, let A' = R \ B and B' = L \ A. Then N r (A') C 5' and < and so we can 
find a witness A", 5" with A" C A', 5" C S' and < n 3 / 4 . 

We use ()103p for this calculation. Now 

« (f) ea (l-^)^ 



h(a)h(a/^h(9/d) ad fla(1 _ a)1 _ a jVj« ^ _ a_y- a ^ _ ^da-9a 



< !> '- 2) " cxp ^, ( 1 + 1 - (d - 0)log(l - 0/d) - j + 0(Yr) } < I 1.0) 



So from (11031) we can write 



an 



l,D) < O ( JL.) ( (^) exp { 1 + i - (d - (?) log(l - 0/d) - 9 + 0(a) } 

(111) 

Now we claim that 

Ci~ 2 > — and that f(x)x~ 2 is monotone increasing in x. (H2) 



First notice that f(x)x 2 = TT~ which is clearly monotone increasing. Second note that 



Ci = C(0) an d since — j-^ > we have 



Km <iCM = lim /(C(,)) 2 



:r->oo dx x->oo ( e CO) _ 1)2 _ ^(a;)2 e C(x) 

= Um = 1 

^ ( e C _ 1)2 _ C 2 e C 

and since C( x ) i s concave we have > 1. This, along with lim a ._ >2 - CO*-) = 0, implies that 

£(x) > x — 2. We can then lower bound 

Cf- 2 = ((Of- 2 > (0 - 2) 9 - 2 > e- e ^ > 0.69 
Using this we see from (jllip that if 

> 9 = 2 + 1 



log(l/az) 

then 



VR (k,l,D)<o(-^) e 



In which case we have 

,3/4 



E E^^^)^°(^) e_fe =°«- (113) 



l<k=2 9>9 a 
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When 9 < 9 we have 9 = 2 + o(l), /(&)/Ci = V 2 + o(l). Therefore 

/ 1 \ / z 2 -(d-2)log(l-2/d)+o(l)\ fc 

m*. ',»)<"y ( — ^ — j ■ 

Now for d > 4 we have 

2 -(d-2)log(l-2/d)+o(l) q 

m to (115> 

and so 

n 3/4 / s k 

fc=i e<e v / \ / 

When (i = 3, the expression on the LHS of (|115p is at most 1.26. So in this case we go back to 
(|103p and replace j^gj^a by j = e o(a) . After this (|110p is replaced by 

a^ 2 > a exp | a (l + ~ + 9 \og{9/d) - 9^j + o(a) 
And then (|114p is replaced by 

/ 1 \ / z 2 -21og(3/2)+o(l)\ fc / ] \ 1 



m 1 



and so 

„3/4 



/ 1 \ 1 

£ J2 Mk,i,D) < l \ = (1). (117) 

k=ie<e \m i J i 

7.3.2 Medium Witnesses 

Let di(n,m) denote the number of /^-vertices of degree i > 2 in G(n,m) and let Di(n,m) = 
E(di(n,m)). 

We define three events: 

Ai(n,m- 1) = |G G g(n,m- 1) : 3i : |dj(ra,m - 1) - Di(n,m- 1)| > n 3/5 /i 3 ,2 < i < log 2 n| 

(118) 

*4 2 («, m - 1) = {G G £(n, m - 1) : 3i : d»(n, m - 1) ^ 0, i > log 2 n} (119) 
B(n,m) = |(? G Q(n,m) : \d 2 (n, m) - D 2 (n, m)\ > 2n 3/5 } (120) 

We argue next that if A(n, m) = Ai(n, m — 1) U .42 (n, m) then 

P(i(n,m)u8(«,ra)) = e- !! ( loE2 "'. (121) 

For any t > we have 

P(|dj(n, m — 1) — D,i(n, in — 

1)1 > *) < 0(ji 1/2 )P(Bm(ii, ft ) > t) 
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where Qi = 

We will now use the following bounds (see for example pQ) 

P(\Bin(n,p) -np\ > t) < 2e~* 2/n , (122) 
P(Bin(n,p) > anp) < (e/a) anp . (123) 

If i < log 2 n then we can use (|122p with t = n 3 / 5 /z 3 to deal with Ai(n, m) and also with B(n, m). If 
i > log 2 n then nqi < e _r2 ( log n ). We can therefore use ()123p with a = 1/nqi to deal with A2(n, m). 
This concludes the proof of (|121l) . 

Now consider a set of pairs X C G(n, m — 1) X £?(n, m). We place (Gi, G 2 ) into X if G 2 is obtained 
from Gi in the following manner: Choose a vertex x G i? of degree at least four in G\. Suppose that 
its neighbours are yt, i = 1,2, ... ,k in any order. To create G2 we (i) replace x by two vertices x 
and m and then (ii) let the neighbours of x in G2 be 2/1,2/2 and let the neighbours of m be 7/3, . . . , y^. 



For G e Q*(n,m - 1) let 
and for G G £/* (n, m) let 
We note that if 

then 



tq(G) = |{G 2 : (G,G 2 ) GX}| 
vr 2 (G) = |{Gi : (G X ,G) GX}|. 

E 1 = ^h%m-1) 

i>4 ^ ' 



• G i A(n,m- 1) implies that |tti(G) - S x | < 0(n 3 / 5 ). 

• 7ri(G) < C 71 ^ 1 ) for all G G £(n,m- 1). 

• G ^ B(n,m) implies that |7r 2 (G) — -D 2 (n,m)| < n 3//5 . 

• 7r 2 (G) < m for all G G Q(n,m). 

We then note that 

(Si -0(n 3 / 5 ))|£(n,m- 1)| < \X\ < (D 2 (n,m) + n 3/5 + me-^ 2 n) )\G{n,m)\. 

Now let V, Q be properties such that if (Gi,G 2 ) G X and G 2 G Q then Gi G "P. Let (Gi,G 2 ) be 
chosen uniformly from X and let Px denote probabilities computed w.r.t. this choice. Then 



and 



So, 



Px(G 2 G Q) > 



|X| 

(|Q| - |jB(w,m)|)(D 2 (ra,m) - n 3 / 5 ) 
|JSf I 



(|Q| - \B(n,m)\)(D 2 (n,m) - n 3 / 5 ) |7?| (Si + 0(n 3 / 5 )) + m|^(n, m — 1) 



|a(n,m)|(L> 2 (n,m) +n 3 / 5 + me- n ( lo § 2n )) \G(n, m - l)|(£i - 0(n 3 / 5 )) 
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So, 



\Q\ -2/5 V \ V \ 



< (l + 0(n-^ b ))- 



\Q(n,m)\ \g(n,m 
So, if Vj is a property of G(n,j) for j = n, n + 1, . . . , m, 



, <(l + 0(n- z ^)) m - n —^^—. (124) 
g(n,m) ~ v v n g(n,n + n 4 / 5 ) y ' 

We use (I124D in the following way: First let n + n 4 / 5 < j < m be the property that G G g(n,j) 
contains a minimal witness A, B with A C L,n 3 / 4 < |A| < n/2. If (Gi,G 2 ) G X and G 2 € £> m +i 
then Gi G £? m . Indeed i,Bfl [m] is a witness in Gi. Applying (|124p and (|105|) we see that whp 
B m fails to occur. Now let Bj be the property that G G G(n,j) contains a minimal witness A,B 
with A C i?,n 3 / 4 < \A\, \B\ < min{|,4| - (j -n),n/2}. If (Gi,G 2 ) G X and G 2 has a witness A 5 
with ACL and n/2 < |A| < n - n 3 / 4 then G 2 G B' m . Indeed A' = R \ A, B' = L \ B is also a 
witness in G 2 . Now if G 2 G £>^ with a witness A', £?' then A' n [m], B' is a witness in Gi and so 
contains a minimal witness A",B" where \A"\ > \B"\ + m — n > n 3//4 i.e. G\ G B' m _ 1 . Applying 
(11241) and (|106f) we see that whp B' m fails to occur. This deals with medium witnesses. 

It only remains to consider m close to dn/2 i.e. where £ defined at the beginning of this section is 
close 1/2. Observe first that the number of edges incident with vertices of degree greater than two 
is at most 3dn(l — 2£). If there are di vertices of degree i = 2, > 3 then <i 2 + d% = m = £dn and 
2<i 2 + 3c?3 < dn which implies that d 2 > dn (3^ — 1). So the number of edges incident with vertices 
of degree greater than two is at most dn — 2dn(3£ — 1). 

Now consider a witness A, B where \A\ = jn. We must have "fdn < 2 r yn + 3dn(l — 2( t ) which implies 
that 7 < 3d ^Z2^ which can be made arbitrarily small. Now the estimate in (|108p will suffice up to 
k < £L n and so we only need to make £ close enough to 1/2 so that 7 < el (which depends only 
on d and not 7). 

7.4 The case m < n — n 4 / 5 

We once again consider medium witnesses separately from small or large witnesses. 
7.4.1 Small/Large Witnesses 

We first go back to (|103j) and deal with irji(k,£, D) for k < n 3//4 as we did in Section 17.31 For 
k > n — n 3 / 4 we deal with 7tl(/c, £, D) for k < n 3//4 . We will go back to (|46h and write 

„ / 1— 8\ n—k 

Now x = d ^ m n ^^k+i dk ^ +1 > implies that 

l-P< D - dk + 1 = (^-) and that x = 0^ l0gn 



n \ n l \ n 



Also, (1 = C(d — x) implies that /(Ci) = /( z )(l ~~ O(x)). Therefore, 



1-0 

0(a 2 log 2 n) 



f(z) \ 1 « , „.,-' 



e 
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Arguing as for (|108j) we get 

Taking p{a) < e~ a as in (|57p and noting that 6 < 1 here we get Thus 

n 3 / 4 fc log n n 3 / 4 fc log n 



e e ^(m,^<e e o (^j (e-^^y=od). ^ 

k=l D=dk k=l D=dk V J 

7.4.2 Medium Witnesses 

Now consider a set of pairs Y C (?(n, m) x t/(n + 1, m). We place (Gi, G 2 ) into y if G 2 is obtained 
from G\ in the following manner: Choose < k < n. Replace edges (£, y) by + 1, y) for all ^ > k 
and all y. Add vertex k + 1 and d edges (A; + 1, y,), j = 1, 2, . . . , d. 

Note that if (Gi, G 2 ) G 1" and Gi has a matching of R into L then so does G 2 . 

For G G Q(n,m) let now 

7Ti(G) = I {G 2 : (G, G 2 ) G y} I 

and for G G <5 (n + 1, m) let 

7r 2 (G) = |{G 1 :(G 1 ,G)GF}|. 

Let 

S2 = (" + < 1 -^)) i 

and for G G <5(n + 1, m) let 

Ls{G) = \{v & L : all neighbours of « have degree at least 3}| . 

Let 

C(n + 1, m) = {G G a(n + 1, m) : |L 3 (G) - £ 2 | < n 3 / 5 } . 

Let 

We note that 

• G G G(n,m) implies that 711(G) = (n + 1)(™). 

• G ^ C(n,m + 1) implies that jvr 2 (G) - E 2 | < n 3/5 . 

• vr 2 (G) < n + 1 for all G G £(n + 1, m). 

We then note that 



\ Y \ > 

- (n + 1) ' 



\Q(n,m)\ \d 

S 2 -n 3 / 5 < — |y ) <S 2 + n 3 / 5 + (n + l)e- n ( log2n ). 

\G{n + l,m)\ 
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Now let V, Q be properties such that if (Gi,G 2 ) G Y and G 2 G Q then Gi G P. Let (Gi,G 2 ) 
be chosen uniformly from Y and let Py denote probabilities computed with respect to this choice. 
Then 

Py(G 2 G C) < P Y (G 1 G V) = ™ n + Vffl 

and 

Py(G 2 G Q) > (\Q\-\C(n+lMm 2 -n^) 
Arguing as in Section [731 we see that if Vj is a property of Q(J, m) for j = m, m + 1, . . . , n, 

J^L < (1 + 0(n -2/5 )r - m Jgl (126) 

C/(n, mj i/(m + n 4 / 5 , to) 

First let Bj,m + n 4 / 5 < j < n be the property that G G Q{j,m) contains a minimal witness 
AS with ,4 C #,n 3 / 4 < \A\ < to/2. If (G 1 ,G 2 ) G X and G 2 G B n+l then Gi G B n . Indeed 
j4, i? n [n] is a witness in Gi. Applying (|126p and (|106j) we see that whp ^ n fails to occur. Now 
let Bj be the property that G G Q(j,m) contains a minimal witness A,B with A C i?, n 3 / 4 < 
A|, |B| < min{AI - (j - m),m/2}. If (Gi,G 2 ) G X and G 2 has a witness A, 5 with A C i? and 
m/2 < Al < 771 - ™ 3/4 tnen G 2 G B^. Indeed A' = L \ A, B' = R \ B is also a witness in G 2 . Now 
if G 2 G B' m with a witness A',B' then A n [m],-B' is a witness in Gi and so contains a minimal 
witness A", B" where |A'I > \B"\ + n - m > n 3 / 4 i.e. Gi G Applying (fT26|) and ^05]) we 

see that whp B' m fails to occur. This deals with medium witnesses. 
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